Evaluation of Features for Audio-to-Audio Alignment
نویسندگان
چکیده
Audio-to-audio alignment is the task of synchronizing two audio sequences with similar musical content in time. We investigated a large set of audio features for this task. The features were chosen to represent four different content-dependent similarity categories: the envelope, the timbre, note-onsets and the pitch. The features were subjected to two processing stages. First, a feature subset was selected by evaluating the alignment performance of each individual feature. Second, the selected features were combined and subjected to an automatic weighting algorithm. A new method for the objective evaluation of audioto-audio alignment systems is proposed that enables the use of arbitrary kinds of music as ground truth data. We evaluated our algorithm by this method as well as on a data set of real recordings of solo piano music. The results showed that the feature weighting algorithm could improve the alignment accuracies compared to the results of the individual features.
منابع مشابه
معیارهای ارزیابی و تولید کتابهای گویا از دیدگاه تولیدکنندگان: تحلیل محتوای کیفی
Purpose: Audio books have a special stand in the publishing industry. Publishers around the world produce audio books with different criterions and standards. This study aimed to identify and introduce the most important criterions for evaluation and production of audio books from the producers' point of view. Methodology: this study was performed with qualitative content analysis of interview...
متن کاملThe Effect of Gloss Type and Mode on Iranian EFL Learners’ Vocabulary Acquisition
Vocabulary is an important component of language proficiency which provides the basis for learners’ performance in other skills. But, since vocabulary learning seems to be so demanding, learners tend to forget newly-learnt words quite soon. In order to identify vocabulary learning conditions which can produce a more lasting effect, this study investigated the effect of three kinds of gloss cond...
متن کاملنهانکاوی صوت مبتنی بر همبستگی بین فریم و کاهش بازگشتی ویژگی
Dramatic changes in digital communication and exchange of image, audio, video and text files result in a suitable field for interpersonal transfers of hidden information. Therefore, nowadays, preserving channel security and intellectual property and access to hidden information make new fields of researches naming steganography, watermarking and steganalysis. Steganalysis as a binary classifica...
متن کاملAudio-to-audio Alignment using Particle filters to Handle Small and Large Scale Performance Discrepancies
We present an approach to improve the audio-to-audio alignment performance of causal alignment systems in the presence of either note-level or sectional performance differences. We explore the use of particle filter based models tailored specifically for online audio-to-audio alignment with a focus on handling missing sections in the audio to be aligned. The proposed approach relaxes the local ...
متن کاملComparing the Impact of Audio-Visual Input Enhancement on Collocation Learning in Traditional and Mobile Learning Contexts
: This study investigated the impact of audio-visual input enhancement teaching techniques on improving English as Foreign Language (EFL) learnersˈ collocation learning as well as their accuracy concerning collocation use in narrative writing. In addition, it compared the impact and efficiency of audio-visual input enhancement in two learning contexts, namely traditional and mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011